Paragraph retrieval for why-question answering Exploiting discourse structure for intelligent paragraph retrieval for why-QA
نویسنده
چکیده
Finding answers to why-questions involves finding arguments in texts, rather than the noun phrases that are typical targets for factoid questions. Detecting arguments requires detecting specific rhetorical structures and relations. Therefore, we proposed the use of Rhetorical Structure Theory (RST) as a tool for discovering answer to why-questions in paragraphs that are likely to contain the answer. We evaluated this method using two sets of why-questions: one obtained by elicitation of native speakers and one containing questions that are asked to the online question answering system answers.com. Our procedure was able to find answers to about 60% of the why-questions. We conclude that some relation types have a high predictive power in answer selection, but we also found that many questions require a full paragraph for an answer. Therefore, we need to shift the research emphasis towards passage retrieval. We propose a three-step method for retrieving passages that are likely to contain the answer to a why-question: (1) query creation, (2) document retrieval and (3) paragraph retrieval and ranking. Standard information retrieval models are not suitable for ranking paragraphs as candidate answers. One issue is the small size of the text units that must be ranked. In addition, we need to incorporate information on the presence of RST relations in the language model used for ranking.
منابع مشابه
Exploiting discourse structure for intelligent paragraph retrieval for why-QA
Finding answers to why-questions involves finding arguments in texts, rather than the noun phrases that are typical targets for factoid questions. Detecting arguments requires detecting specific rhetorical structures and relations. Therefore, we proposed the use of Rhetorical Structure Theory (RST) as a tool for discovering answer to why-questions in paragraphs that are likely to contain the an...
متن کاملEvaluating Answer Extraction for Why-QA using RST-annotated Wikipedia texts
In this paper the research focus is on the task of answer extraction for why-questions. As opposed to techniques for factoid QA, finding answers to whyquestions involves exploiting text structure. Therefore, we approach the answer extraction problem as a discourse analysis task, using Rhetorical Structure Theory (RST) as framework. We evaluated this method using a set of why-questions that have...
متن کاملEvaluating paragraph retrieval for why-QA
We implemented a baseline approach to why-question answering based on paragraph retrieval. Our implementation incorporates the QAP ranking algorithm with addition of a number of surface features (cue words and XML markup). With this baseline system, we obtain an accuracy-at-10 of 57.0% with an MRR of 0.31. Both the baseline and the proposed evaluation method are good starting points for the cur...
متن کاملBoosting Passage Retrieval through Reuse in Question Answering
Question Answering (QA) is an emerging important field in Information Retrieval. In a QA system the archive of previous questions asked from the system makes a collection full of useful factual nuggets. This paper makes an initial attempt to investigate the reuse of facts contained in the archive of previous questions to help and gain performance in answering future related factoid questions. I...
متن کاملLegal Question Answering using Ranking SVM and Deep Convolutional Neural Network
This paper presents a study of employing Ranking SVM and Convolutional Neural Network for two missions: legal information retrieval and question answering in the Competition on Legal Information Extraction/Entailment. For the first task, our proposed model used a triple of features (LSI, Manhattan, Jaccard), and is based on paragraph level instead of article level as in previous studies. In fac...
متن کامل